Automatically Identifying the Source Words of Lexical Blends in English

نویسندگان

  • Paul Cook
  • Suzanne Stevenson
چکیده

Newly coined words pose problems for natural language processing systems because they are not in a system’s lexicon, and therefore no lexical information is available for such words. A common way to form new words is lexical blending, as in cosmeceutical, a blend of cosmetic and pharmaceutical. We propose a statistical model for inferring a blend’s source words drawing on observed linguistic properties of blends; these properties are largely based on the recognizability of the source words in a blend. We annotate a set of 1,186 recently coined expressions which includes 515 blends, and evaluate our methods on a 324-item subset. In this first study of novel blends we achieve an accuracy of 40% on the task of inferring a blend’s source words, which corresponds to a reduction in error rate of 39% over an informed baseline. We also give preliminary results showing that our features for source word identification can be used to distinguish blends from other kinds of novel words.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using social media to find English lexical blends 1

We present a method for identifying English lexical blends — words such as complisult (compliment + insult) and globesity (global + obesity) — from social media, specifically Twitter. Our method is based on observations about words and phrases that are commonly used to introduce new words and corpus patterns that are often used to describe the meaning of lexical blends, and leverages the massiv...

متن کامل

Emergent Faithfulness to Proper Nouns in Novel English Blends

Emergent effects (McCarthy & Prince 1994) are the result of phonological constraints or rankings that only reveal themselves in a specific context. That is, they have no discernable effect in the regular phonology of a language but become apparent when speakers perform particular tasks. Crucially, they reveal knowledge that was not learned directly from ambient language data. Emergent effects h...

متن کامل

The Effect of Raising Morphological Decomposition Awareness on Lexical Knowledge of Complex English Words

Lexical knowledge of complex English words is an important part of language skills and crucial for fluent language use. This study aimed to assess the role of morphological decomposition awareness as a vocabulary learning strategy on learners’ productive and receptive recall and recognition of complex English words. University students majoring English at the...

متن کامل

Head Faithfulness in Lexical Blends: a Positional Approach to Blend Formation

KATHERINE SHAW: Head Faithfulness in Lexical Blends: A Positional Approach to Blend Formation (Under the direction of Elliott Moreton) This thesis applies Positional Faithfulness theory (Beckman 1998) to the problem of lexical blending in English. Lexical blends, like brunch or motel, contract multiple source words into a single lexical item shaped by competing sets of phonological and psycholi...

متن کامل

First Language Activation during Second Language Lexical Processing in a Sentential Context

 Lexicalization-patterns, the way words are mapped onto concepts, differ from one language      to another. This study investigated the influence of first language (L1) lexicalization patterns on the processing of second language (L2) words in sentential contexts by both less proficient and more proficient Persian learners of English. The focus was on cases where two different senses of a polys...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Linguistics

دوره 36  شماره 

صفحات  -

تاریخ انتشار 2010